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CUE PROBABILISM AND INFERENCE BEHAVIOR: 


LEE ROY BEACH 
Office of Naval Research, Washington, D.C. 


This is an investigation of how Ss use probabilistic cues to make inferences 
about objects’ (Os’) class memberships. Os having specified probabilistic cue 
properties were presented and Ss’ inferences were compared to the theo- 
retical predictions. It was concluded that: (a) for unfamiliar Os cue proba- 
bilism is important in class inferences; (b) for familiar Os inferences are 
based upon recognition; (c) for unfamiliar Os which are similar to familiar 
Os the inference is the class appropriate to the familiar O; (d) Ss who 
rely upon recognition also rely upon the similarity between familiar and 
unfamiliar Os in order to make inferences for unfamiliar Os. 


RUNSWIK’S (1956) theory of proba- 
bilistic functionalism presents not 
only a point from which to view the be- 
havior of men in their environment, but also 
provides a framework within which further 
theorizing can take place. The present 
work is an attempt to use this theory as 
a point of departure for an investigation of 
how subjects (Ss) make inferential judg- 
ments about the identity of objects (Os) 
encountered in the environment. 
Probabilistic functionalism is a theory 
of perception and cognition. It views Ss 
as operating in a complex environment 
in which cues associated with Os are not 
perfectly correlated with the attribute be- 
ing inferred. For example, height correlates 








1 Opinions or conclusions contained in this report 
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The author wishes to express his sincere appre- 
ciation to R. K. Hammond, M. Wertheimer, J. R. 
Berkshire, Alice Garman, and Evelyn Helvey for 
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approximately .10 with IQ and is there- 
fore a poor cue upon which to base in- 
ferences about IQ. Vertical position in the 
visual field, however, correlates .60 with an 
O’s distance from S and thus is a moder- 
ately good cue for inferring distance 
(Brunswik, 1956). 

The equivocal relationship between cues 
and Os, called cue probabilism, requires 
that Ss learn to utilize optimally the various 
kinds of cues. Correlational analyses of 
inference behavior show that the correla- 
tions between each of the different kinds of 
cues and Os are paralleled by the correla- 
tions between Ss’ inferences and the cues. 
Moreover, the validity of Ss’ inferences 
about Os (also measured by correlation) 
is higher than the correlations between 
any one of the different kinds of cues and 
Os (Brunswik, 1956; Rappoport, 1961; 
Smedslund, 1955; Summers, 1962). These 
findings indicate that Ss learn to utilize 
probabilistic cues and that by combining the 
information from multiple cues they can 
make accurate inferences about Os. 

Because the theory of probabilistic func- 
tionalism is stated in terms of correlation 
it is not surprising that the related research 
has relied upon this analytic technique. 
The advantage in the use of correlation 
is, of course, that it permits succinct state- 
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ments about Ss’ use of cues for inferences 
about a group of Os or for repeated in- 
ferences about O. The disadvantage is, 
however, that the unique event—a single 
inference for one O—is glossed over in 
favor of the more general correlational 
statement. It would be valuable to have 
an alternative theoretical and analytical 
approach to inference which would permit 
the examination of the inferences made for 
each O as a unique event. The present 
work attempts to provide such an approach. 


Nature of the Environment 


Cues. The first assumption is that the 
immediately apparent, palpable character- 
istics of an O are the bases for inferences 
about its covert, impalpable attributes. For 
example, an O’s size, surface texture, weight, 
color, and shape all supply information 
about what kind of O it is—whether O 
is a house, a cup, a peach, etc. The imme- 
diately apparent characteristics of Os are 
called cues. 

Cue Dimensions. If Os are compared 
for some characteristic, such as weight, it 
is clear that different Os possess the char- 
acteristic to different degrees. What is more, 
if a large number of Os are compared on 
this characteristic, it is possible to order 
them in some manner. The metric created 
by the ordering of Os according to their 
possession of an attribute is called a cue 
dimension. The value on a cue dimension 
which corresponds to the degree to which an 
O possesses the related characteristic is 
called O’s cue value. (Dichotomous cue 
dimensions are considered as special cases 
of the more general multivalued dimension.) 

Covert Attributes. Os’ cue values lie at 
one pole of the inference process. At the 
other pole is the goal of the inference—the 
covert, not immediately apparent char- 
acteristics of O. The covert attributes are 
of two kinds, O’s identity, its class mem- 
bership, and other attributes of O which S 
does not know, its covert cue values.2 The 
covert attribute with which the present 


* While for the purposes of the present discus- 
sion it can be assumed that fundamentally there is 
no difference between the cues and the covert at- 
tributes, the fact that cues are observable at the 
time the inference is made while the covert at- 
tributes of O are not, permits a psychologically 
meaningful distinction between them. 


work is concerned is O’s class membership. 
The inference process goes from the (overt) 
cues associated with O to the identification 
of the (covert) class to which O belongs. 
In this situation S must rely on the cues, 
make an inference about the class, and 
finally receive information of some kind 
about the accuracy of his inference. 

Many definitions of the term class have 
been used in previous research (Bruner, 
Goodnow, & Austin, 1956; Sarbin, Taft, & 
Baily, 1960). The definition to be used here 
is that a class is a name assigned to two 
or more Os. This name is used to identify 
Os, and its use is usually dictated by a 
source external to S. Indeed, the process 
of a child learning about Os in his world 
consists in part of forming links between 
different cues and the names which elders 
indicate to be “correct.” 

Probabilism of Cues. The process of in- 
ferring an O’s class from its cue values is 
complicated by the fact that a given cue 
value has usually been associated with 
a number of different classes of Os. Con- 
sequently, knowledge of this value does not 
allow an unequivocal inference about O’s 
class membership. This equivocality, which 
is inherent in the use of cues as the bases 
of inferences, is called cue probabilism. 

Cue Probabilism and Classes. In the 
environment every O possesses cue values 
and also belongs to a class. Theoretically, 
for each cue value on each cue dimension 
it is possible to construct a distribution of 
the relative frequency of association of the 
cue value with various classes of Os. If it 
is assumed that events which have occurred 
with the greatest relative frequency in the 
past are the events most likely to occur in 
the future, this distribution can be used to 
make inferences about the class membership 
of Os. Given a cue value associated with an 
O, the class having the greatest relative 
frequency of association with the cue value 
is most likely to be the correct class for O. 

The relative frequency distributions for 
cue-class associations are called cue dis- 
tributions and are the keystone of the pres- 
ent approach to inference. 


Nature of the Subject 


Cues. Basically the definition of cues must 
rely on psychophysically defined units 
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along dimensions of physical stimulation; 
however, it is obvious that with experience 
“higher order” dimensions are developed. 
For example, numbers, while abstract, con- 
vey a great deal of information about Os 
with which they are associated. Experience 
not only leads to higher order cue dimen- 
sions but, in addition, individual differences 
in experiential background may produce 
individual differences in the dimensions to 
which Ss attend and individual differences 
in the size of the units on the dimensions. 
Studies by Beach and Wertheimer (1961) ; 
Brunswik (1956); Goodnow (1954); Hays 
(1958); Osgood, Suci and Tannenbaum 
(1957) ; Signell (1961) ; Wertheimer (1960) ; 
and others demonstrate these - individual 
differences. While these variables must be 
examined when research is done, for sim- 
plicity of discussion it is assumed momen- 
tarily that Ss are attending to all dimensions 
and that the sizes of the cue units on 
these dimensions are the same for all Ss. 

Classes. The cue distributions have a 
subjective counterpart resulting from S’s 
experience with Os and their classes. It 
is assumed that during this experience S 
compiles the relative frequencies of class 
occurrence for the values on the cue dimen- 
sion and, as a result, knows the distribution 
of cue-class associations for each cue value 
on each cue dimension in the environment. 
These subjective counterparts of the en- 
vironmental distributions are called Ss’ 
distribution hypotheses (Brunswik, 1956). 

The assumption of distribution hypothe- 
ses regards S as an actuary (Meehl, 1954; 
Sarbin, 1943). And, just as an actuary’s 
tables become more accurate with an in- 
creased sample size, so too do distribution 
hypotheses become more like their objective 
counterparts as S’s experience with various 
cues and various classes of Os increases. 
With increased experience, S’s distribution 
hypotheses should become highly accurate 
representations of the corresponding en- 
vironmental cue distributions. This point 
will assume a great deal of methodological 
importance in the experiments to be re- 
ported. 

In addition to the differences in atten- 
tion to cues, it should be noted that Ss’ 
distribution hypotheses may not conform to 


their objective counterparts. While little is 
known about the determinants of the dis- 
crepancies it is likely that as an S’s “sample 
size” increases, his distribution hypotheses 
approach accuracy (see Attneave, 1953; 
Edwards, 1961; Preston & Baratta, 1948, 
for related research). For the sake of 
simplicity in the following discussion it is 


‘assumed that Ss’ various distribution hy- 


potheses are accurate. 


Nature of the Inference Process 


When an O is encountered in the en- 
vironment, the cue values associated with 
it serve as the basis for a decision about 
its class membership. To make this decision 
S must rely upon his distribution hypoth- 
eses. Due to the fact that any cue yalue has 
usually been associated with more than 
one class of Os, the cue does not allow 
an unequivocal decision about the class 
membership of the O which possesses it. 
However, for each cue value associated with 
O, the related distribution hypothesis tells 
S the relative frequency with which the cue 
previously has been associated with the 
members of each class. The O should be 
assigned to the class for which the relative 
frequency is highest. Of course, while one 
cue may indicate a specific class to be the 
most likely alternative for O’s membership, 
other cues may indicate other classes to be 
most likely. tata 

When an O’s various cues do not yield an 
unequivocal inference, the best that S 
can do is to gather together the information 
provided by each of the cues and to arrive 
at a compromise which will allow him 
to determine which class is the “best bet” 
(Brunswik, 1943) for O’s class membership. 
This best bet should be S’s inference for 
that O. The following discussion describes 
a simple method for using the probabilistic 
aspects of cues to arrive at the best bet 
for an O’s class membership and for pre- 
dicting Ss’ inferences. 

Inference Model. The probabilistic re- 
lationships between cues and classes are 
the key to the use of an O’s cue values 
to determine the best bet for its class mem- 
bership. When an O is encountered in the 
environment, some of its cue values are 
immediately known. However, O’s cue value 
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on each dimension could be associated with 
a member of any of a number of different 
classes. As a result, the O possessing the cue 
values could be a member of any one of 
these classes. Therefore, it is necessary to 
compile, for each class, the evidence which 
indicates that O belongs to that class. Then 
the evidence for each class must be con- 
sidered, relative to the evidence for each of 
the other classes, and a decision made 
about which class is the best bet for O’s 
membership. Finally, O should be relegated 
to the class which was determined to be the 
best. bet. 

Equation 1 describes a simple way to 
“compile the evidence” for each class. 


D Plk/a) = P(%/1) 


+ P(k/2) + P(k/3)---P(k/n). [1] 


Where, 

k = a specific class under consideration 
as a possibility for O’s membership; k = 
a through 7 

d = a dimension on which O’s cue value 
is known; d = 1 through n 

P(k/d) is the relative frequency with 
which O’s cue value on each cue dimen- 
sion, d, has been associated with members 
of a given class, k. 

Equation 2 describes a way to compare 
the “relative amounts of evidence” for the 
different classes. 


> P(k/a) 
ae (2] 


n 


a >, P(k/d) 


d=1 


Where, 

>32-1P(k/d) is the value for one class, k, 
derived from Equation 1 and 

Dice D1 P(k/d) is the sum of these 
values over all classes, a through 2. 

The denominator of Equation 2 can be 
simplified. The proportion of times a cue has 
been associated with each class sums to 1.00 
over all classes; a cue never occurs without 
an O, and Os always belong to classes. 
Therefore, the denominator of Equation 
2 becomes, 


> >> P(k/d) = 100 X¥ n =n. 
k=a d=1 


Where, 
n is the total number of cue dimensions 
for which O’s cue values are known. 
Replacing the denominator in Equation 
2 with n yields Equation 3. 


> P(k/d) 
eee et iz 


Equation 3 gives the proportion of the 
total evidence which indicates that O is 
a member of Class k. This, E(k), can be 
regarded as the degree to which a given class 
can be expected to be correct in light of 
O’s immediately apparent cue values. The 
best bet for O’s class membership is the 
class having the highest E(k) as given 
by Equation 3. 


MeruHop 


Theoretically, Ss’ inferences about Os are predi- 
cated upon the statistical structure of the environ- 
ment. Therefore to test the utility of the model an 
experimental situation was constructed which had 
controllable statistical characteristics and then, be- 
cause distribution hypotheses are assumed to de- 
velop with experience, Ss were allowed experience 
with the “experimental environment” sufficient to 
develop accurate distribution hypotheses. Finally, 
Ss’ inferences for specific Os were compared to the 
inferences which were predicted by the model. 

To compare Ss’ inferences with those predicted 
by the model, data were obtained from two 
sources: (a) inferences made for specially designed 
test Os and (b) inferences made for each of the 
Os which comprised the experimental environ- 
ment. The latter are called the environmental Qs. 


Test Os 


The use of test Os required that Ss first be 
given enough experience in the experimental en- 
vironment to develop accurate distribution hy- 
potheses. Then test Os were presented, and the 
model’s decision about the best bet for each test 
O’s class was compared with the Ss’ inferences for 
the test O. This procedure allowed an estimation 
of the utility of the model for predicting Ss’ infer- 
ences about an O which has never before been en- 
countered. 


Environmental Os 


Inferential Accuracy. Our definition of a class 
as a covert attribute about which S does not know 
provides a criterion for designating an inference 
as either accurate or inaccurate. An inference is 
inaccurate when an O is assigned to a class which, 
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in the light of later information, proves to be in- 
correct. 

Inferential accuracy was an important tool for 
the investigation of Ss’ inferences. To evaluate the 
utility of the model for predicting inferences for 
test Os, it was sufficient to compare Ss’ inferences 
for each test O with the theoretical best bet for 
that test O; Ss either chose the same class as the 
model or they did not. For the environmental Os, 
however, it was possible to examine Ss inferences 


for Os repeatedly encountered in the experimental 


environment. Here the course of Ss’ increase in 
inferential accuracy could be examined in light of 
our assumptions about the inference process. It 
was predicted that in the early stages of training 
the accuracy for O would be low; however, as ex- 
perience increased and the distribution hypotheses 
became more accurate, the accuracy of inferences 
for the given O would increase. 

At first glance it appears as though an increase 
in inferential accuracy could be accounted for 
merely by paired-associates learning of Os and 
their classes. However, by using the model it was 
possible to specify Os for which accuracy should 
not increase, as well as the errors which should be 
made for these Os. If there were no differences be- 
tween Ss’ inferential accuracy for these Os and 
their inferences, the present theory could be re- 
garded as incorrect. If, however, Ss’ inferential 
accuracy for these Os differed from the accuracy 
for other Os, it would constitute strong support 
for the validity of the present approach to infer- 
ence behavior. 

Confirming and Infirming Cases. When the 
model was used to infer the class membership of 
an O and the inferred class is the one to which it 
actually belongs, it was called a confirming case. 
On the other hand, some Os’ cue values lie in the 
tails of the cue distributions. Applying the model 
to these Os resulted in designating an incorrect 
class as the best bet for their class membership. 
These Os were called infirming cases. 

It was predicted that if the model is an accu- 
rate analogy to Ss’ inference behavior, inferential 
accuracy for confirming cases should increase as 
experience in the environment increases, 1.e., as 
distribution hypotheses become accurate. On the 
other hand, Ss’ inferences for infirming cases should 
follow the errors of the model. These cases should 
be assigned repeatedly to the class which the 
model dictates to be the best bet even though it 
is incorrect. Thus, inferential accuracy for infirming 
cases should decrease as Ss’ distribution hypotheses 
become accurate, and the class to which they are 
assigned should be the model-determined best bet. 
In short, Ss should make the same inferences as 
are made by the model. 

In summary, if the theory is correct and the 
model is adequate, it was predicted (a) that Ss 
would learn to use the probabilistic aspects of the 
cues associated with Os in order to make accurate 
inferences about Os’ class memberships; (b) that 
for test Os Ss’ inferences would correspond to the 
class designated as the best bet by the model; (c) 
that for environmental Os which were identified 


by the model as confirming cases Ss’ inferential 
accuracy would increase as experience in the ex- 
perimental environment increased; (d) that for 
environmental Os which were identified by the 
model as infirming cases Ss’ inferential accuracy 
would decrease as experience in the experimental 
environment increased; (e) that Ss’ inference er- 
rors for infirming cases would result from assigning 
these Os to the class corresponding to the class 
designated as the best bet by the model. 


EXPERIMENT [ 


Construction of the Experimental Environ- 
ment 


The experimental environment consisted of a 
deck of 3 X 5 inch cards, each of which was con- 
sidered an O. On its face each card possessed one 
value from each of three cue dimensions: (a) 
letter (Values a through 2), (b) number (Values 1 
through 9), and (c) pointer position (Values 1 
through 9 in a clockwise direction). The class to 
which each card belonged—red, yellow, or blue— 
was written on its back. 

In order to specify the sets of cues and the 
class for each card, arbitrary frequency distribu- 
tions were defined for each class on each cue di- 
mension. These distributions, presented in Table 
1, were then used as the marginals on three 9 X 
9 X 9 matrices, one matrix for each of the three 
classes. The marginal distributions determined fre- 
quencies in the matrix cells, and each matrix cell 
represented a combination of three cue values. 
Next, cards bearing each cell’s cue combination 
were made; the number of cards made was dic- 
tated by the frequency (never larger than six) in 
the particular cell, and the color on their backs 
was dictated by the matrix to which the cell be- 
longed. This procedure yielded a deck of 120 cards, 
40 in each class. 

There were 62 different cue combinations. (In 
the remainder of the discussion, individual cards 
will be referred to according to their cue combina- 
tions. For example, {26 means a card possessing 
cue value f from the letter dimension, cue value 2 
from the number dimension and the cue value 6 
from the pointer dimension. There were four cards 
bearing the cues £26 in the deck.) All three cue 
dimensions had approximately equal correlations 
with the classes (contingency coefficients: of .51 
for letter, 51 for number, .40 for pointer). 

After the deck was constructed P(k/d) was com- 
puted for each value on each dimension for each 
class. These P(k/d), presented in Table 1,. were 
then used in the application of the model to each 
card in the deck in order to obtain the theoretical 
estimate of the class which was the best bet for 
the card. Ordinarily the class with the highest 
value of E(k) was the class to which the card ac- 
tually belonged; when this occurred, the card was 
designated a confirming card. For some cards, 
however, the model yielded values of H(k) which 
indicated that the best bet for their class member- 
ship was a class other than the one to which they 
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TABLE 1 


FREQUENCY DisTRIBUTIONS FOR Cur VALUES’ ASSOCIATION WITH HacH CLASS AND THE CORRESPONDING 
Vauugs or P(k/d): Experiment I 
































Letter 
Class E 
a b c e i g h i 
Red 
f 0 0 0 0 4 6 10 14 6 
P(k/d) .00 .00 .00 .00 .20 -43 71 .93 1.00 
Yellow 
i 0 1 4 8 14 8 4 1 0 
P(k/d) .00 .07 .29 st .70 DT .29 .07 .00 
Blue 
i 8 14 10 6 BD 0 0 0 0 
P(k/d) 1.00 .93 mri 43 .10 .00 .00 .00 .00 
Number 
1 2 3 5 6 i 8 9 
Red 
if 0 1 4 8 14 8 4 1 0 
P(k/d) .00 .07 .29 55Y/ 74 «08 Rol .08 .00 
Yellow 
ii 4 14 10 6 4 2 0 0 0 
P(k/d) 1.00 .93 sefl 483 Beli -13 .00 .00 .00 
Blue 
a 0 0 0 0 1 5 9 11 14 
P(k/d) .00 .00 .00 .00 -05 ook .69 .92 1.00 
Pointer 
1 2 3 5 6 7 8 9 
Red 
rfl 4 14 10 6 4 2 0 0 0 
P(k/d) 1.00 .93 Sth 43 .18 a2 .00 .00 .00 
Yellow 
f 0 0. 0 0 4 6 10 14 6 
P(k/d) .00 .00 .00 .00 .18 Bos sill .93 1.00 
Blue 
f 0 1 4 8 14 8 4 1 0 
P(k/d) .00 .07 .29 On .64 .50 .29 07 .00 





actually belonged. These cards were designated in- 
firming cards.* This procedure resulted in 53 dif- 


“It should be understood that infirming cases 
were not violations of the statistical properties of 
the experimental environment. Instead, they were, 
in a sense, errors made by the model. An infirming 
case appears a certain proportion of the time rela- 
tive to the other Os in the environment, and its 
repeated occurrence does not lead to an increase in 
the relative freqeuncy of its values in the cue 
distribution for its class..The values used in the 
model are the relative frequencies from these dis- 
tributions and, therefore, each time the infirming 
case appears, the same relative frequencies are used 
in the computations. In this experimental situa- 
tion, given the same cue values, the model will al- 
ways assign the infirming case to the incorrect 
class. 


ferent kinds of confirming cards and 9 different 
kinds of infirming cards. 

In addition to the deck of cards, a number of 
test cards (test Os) was made. There were 4 test 
cards having three cue values on them and 10 hay- 
ing two cue values on them. The combination of 
cues on these cards was unlike any of the combina- 
tions on the cards in the deck. Three blank cards 
were also included among the test cards. 


Procedure 


The Ss were 40 undergraduate university stu- 
dents who met as a group during the time they 
were gaining experience with the experimental en- 
vironment. As will be explained below, only 10 Ss 
were used in the data analysis. On the first day it 
was explained to Ss that by attending to the cues 
on the front of the cards they could come to make 
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accurate inferences about.the class to which the 
card belonged. They were further told, “each card 
is an individual object and is to be thought of and 
treated as such.” The procedure was then ex- 
plained, and Ss were told to look at the cues on 
the face of the card, make a decision about the 
card’s class, and write an inference on the answer 
sheet. After a 4-5 second delay the experimenter 
(FE) would reveal the correct class for that card 
and present the next card. If the inference was 
correct, Ss were instructed to place a check beside 
it on the answer sheet. This was done to increase 
Ss’ involvement in the task. 

Twelve identical decks of the 120 cards were 
each shuffled and placed one after another in a 
box. Although Ss were unaware of where one deck 
left off and another began, they were shown the 
first 2 of the 12 decks in the first training session. 
Decks 8, 4, and 5 were seen in the second session; 
Decks 6; 7, and 8 in the third session; and Decks 
9, 10, and 11 in the fourth session. Deck 12 was 
seen later, in a fifth session. Each time a full deck 
was seen will be referred to as a trial. For the fifth 
training session, Ss met individually with H# and 
were shown the twelfth deck (Trial 12). Then, S 
was shown the test cards and was asked to make 
a decision about their class membership. 


Results 


Cue Dimension Use. In order for the 
computations for the model to be applicable 
in predicting Ss’ responses, it was necessary 
to determine which Ss were relying on all 
three dimensions. To do this it was assumed 
that Ss’ distribution hypotheses would be 
most accurate for the dimensions to which 
they attended. Therefore, the group was 
shown a blank test card and asked what 
cue value from each of the dimensions 
would be most likely to appear on the card’s 
face if it were known that the card was, 
say, a member of the red class. Then, for 
each S, the absolute difference was obtained 
between his judgment and the actual mode 
of the related cue distribution. When the 
mean of these differences for one of the cue 
dimensions was at least twice as large as the 
mean of the differences for the other two 
dimensions, it was assumed that S was not 
attending to the dimension. Only the 10 
Ss found to be using all three dimensions 
were used in the data analysis.* 


4 Because it may appear that the results to be 
reported are due to selection of Ss it should be 
noted that a similar analysis for the 20 Ss who were 
using only the letter and pointer dimensions yielded 
virtually identical results. The two-cue analysis re- 
quired deletion of the number dimension from the 
computations for every E(k) in the experiment. 


Inferential.Accuracy. The first prediction 
was that Ss would be able to use the 
probabilistic nature of cues to make ac- 
curate inferences about Os. The curve in 
Figure 1 shows the mean number of in- 
ferential errors for each of the 12 trials 
and clearly shows an increase in inferential 
accuracy accompanying an increase in ex- 
perience. The decrease in errors as a func- 
tion of experience in the experimental en- 
vironment (the training deck), together 
with the rest of the data to be presented, 
are taken as support for the prediction. 

Test Cards. The second prediction was 
that Ss’ inferences for the class member- 
ship of test cards would correspond to the 
best bet as dictated by the model. The 
results for this prediction are shown in 
Table 2. 

This table is divided into 14 horizontal 
sections, one for each test card. In each 
section there are 5 columns. The first column 
contains the cues which were on the test 
card. (A dash signifies the absent cue on 
the test cards which have only two cue 
values.) The second column indicates the 
three classes, and the third contains the 
values of H(k) corresponding to each class. 
The class and value of E(k) set in boldface 
type in the second and third columns desig- 
nate the best bet on the basis of the model. 
The fourth column presents the number of 
Ss who gave each class as their inference for 
the test card. The last column gives the 
value and significance for a one-sample 
chi-square test for the results in the fourth 
column. 

The response distributions for 10 of the 
14 test cards differ significantly from chance. 
For all of these 10 the greatest number of 
Ss chose the class which the model dictated 
to be the best bet. Of the four test cards 
yielding insignificant response distributions 
one, c-6, is in the predicted direction and 
another, d54, shows a split response, dis- 
tribution corresponding closely to the nearly 
equal values of H(k) for classes red and 
blue. These results support the hypothesis 





Because the experiment was originally designed for 
three-cue dimensions the two-cue theoretical pre- 
dictions were frequently not as clear-cut as the 
three-cue predictions. Therefore, to avoid needless 
repetition and complexity in presentation the two- 
cue results are not presented here. 
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Fic. 1. Mean number of errors made on each trial by 10 Ss: Experiment I. 


that Ss inferences for test cards correspond 
to the best bets yielded by the model. 

Confirming and Infirming Cards. The 
third prediction was that Ss’ inferential 
accuracy for confirming cases should in- 
crease with increased experience with the 
experimental environment and the fourth 
was that there would be a decrease in 
inferential accuracy for infirming cases. 

In order to evaluate the accuracy of 
these two predictions the proportion of 
Ss giving each of the three response alterna- 
tives was computed for each card for each 
trial.5 The cards were then divided into 
confirming and infirming cards, and the 
mean proportion of Ss giving each response 
alternative was found for the two kinds 
of cards. These are plotted in Figures 2, 
3, and 4. 

In Figure 2 the mean proportion of Ss 
giving the correct class for the confirming 
cards is compared with the mean propor- 
tion of Ss giving the correct class for in- 
firming cards on each of the 12 trials. These 
results support the third prediction, infer- 
ential accuracy for confirming cards clearly 


°The data analysis dealt only with the propor- 
tions yielded for the first time a card appeared on 
a trial. For example, there are six d37 cards in the 
deck; the analysis utilizes the results for the first 
d37 to appear on each trial. 


increases as Ss’ experience with the ex- 
perimental environment increases. The 
fourth experimental prediction is contra- 
dicted by the results in Figure 2; the in- 
ferential accuracy for infirming cards in- 
creases with experience in the experimental 
environment. 

The fifth prediction was that errors on 
the infirming cards would result from as- 
signing those cards to the class which the 
model dictates as the best bet, rather than 
to the correct class. The results for this pre- 
diction are given by a comparison of Figures 
3 and 4. For the confirming cards the 
curves in Figure 3 represent the mean 
proportion of Ss giving the correct response 
(which is also the theoretical best bet) and 
the other two responses for each trial. The 
top curve indicates the correct response, and 
the bottom two curves indicate the two 
incorrect alternatives. For the infirming 
cards the curve in Figure 4 which starts 
in the middle and ends high is the mean 
proportion of Ss who gave the correct re- 
sponse (which is not the theoretical best 
bet) on each of the 12 trials. The curve 
which starts high and ends in the middle 
is the mean proportion of Ss who gave 
as their inference the alternative which, 
while incorrect, was the theoretical best 
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bet. The curve that starts low and ends 
lower is the mean proportion of Ss who 
on each trial gave as their inference for 
the infirming cards the alternative which 
was neither correct nor the theoretical 
best bet. The fifth prediction is supported 
by these data; for the most part, errors 
for infirming cards resulted from assigning 
the cards to the class which was the theo- 
retical best bet rather than to the correct 
class. 


Discussion 


These results indicate that Ss learned to 
utilize probabilistic cues to make accurate 
inferenees for Os (see Figure 1). Moreover, 
by using the model it was possible to make 
fairly accurate predictions of Ss’ responses 
for test Os which they had never seen before 
(see Table 2). For those Os which the model 
designated to be confirming cases Ss quickly 
came to give the correct class, and the 
alternative inferences dropped out. How- 
ever, the results for the infirming cases were 
not so clear-cut. For these cases Ss tended 
to give the theoretical best bet in the early 
trials, but later they tended to give the 
correct—low E(k) value—response. This 
result contradicts the hypothesis that in- 
ferential accuracy would decrease for in- 
firming cases. However, the infirming card 
results support the fifth hypothesis; the 
incorrect responses for infirming cases are 
usually the theoretical best bet as de- 
termined by the model. 

While the results of the experiment in- 
dicated that the theoretical approach out- 
lined in the introduction is a promising way 
of viewing inference behavior, the small 
number of classes (three) and the small 
number of cue values (nine) on each cue 
dimension place restraint upon acceptance 
of the results. It seemed possible that 
these results might not be obtained a second 
time if more classes and longer cue dimen- 
sions were used. And, of course, the major 
question is how the fourth prediction, that 
there would be a decrease in inferential 
accuracy over trials for infirming cards, 
would fare in a replication of the experi- 
ment. 

Furthermore, the small number of Ss 
and the way in which the test cards were 
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Fic. 2. Mean proportion of Ss giving correct class as their responses: Experiment I. 
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Fic. 4. Infirming card inference results for Experiment I. 


presented to Ss also place limitations upon 
the acceptance of the results of the study. 
In this experiment Ss were shown the test 
cards in a separate session following the 


learning trials; for some Ss this session was 
almost 48 hours after the final training trial. 
This delay may have influenced the results 
on the test cards. An alternative to this pro- 
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cedure would be to place the test cards in 
the training deck on the last trial so that Ss 
would make inferences about them in the 
course of making inferences about the deck 
cards. 

Because of these limitations in experi- 
mental procedure and because a replication 
of the obtained results would be strong 
support for the utility of the present theo- 
retical approach, another experiment was 
conducted. 


EXPERIMENT II 


Construction of the Experimental Environ- 
ment 


The research strategy used in the first experi- 
ment also guided the procedure used in this one. 
The deck was larger in that it consisted of 200, 
rather than 120 cards. Each card belonged to one 
of five different classes (red, yellow, blue, green, 
or orange) and there were 40 cards in each class. 
The cue dimensions were expanded to 12 values: 
a through J on the letter dimension, 1 through 12 
on the number dimension, and Positions 1 through 
12 (in a clockwise direction) on the pointer dimen- 
sion. The method used in the first experiment to 
select the cue combination for each card and the 
number of each kind of card in the deck was also 
used for this experiment. (Contingency coeffi- 
cients for the relationship between each cue di- 
mension and the classes were .76 for letter, .70 for 
number, and .75 for pointer, indicating that no 
one of the dimensions was a better predictor of 
the cards’ classes than the other dimensions.) The 
matrix marginal distributions for each class and 
the values of P(k/d) for this deck of cards are 
presented in Table 3. This deck was not an exten- 
sion of the one used in the first experiment. The 
method of construction is the same, but the deck 
and the cue distributions were completely new. 

Application of the model to the deck of cards 
yielded 78 different kinds of cards which were con- 
firming cards and 14 which were infirming cards. In 
addition to the deck cards 15 test cards were con- 
structed, 12 of which had three cue values and 
3 of which were blank. 


The Ss 


The Ss were United States Naval Aviation 
Cadets of approximately the same age and intelli- 
gence as Ss used in the first experiment. Because 
part of the group of Ss transferred before the ex- 
periment was completed, the data are reported 
for two groups. Group A had 13 training sessions, 
and Group B had only 8 sessions. The Ss using all 
three cue dimensions were selected in the same 
way as in Experiment I. This resulted in the use 
of data from 16 out of 39 Ss in Group A and 20 
out of 41 Ss in Group B. 


Procedure 


The procedure was the same as for Experiment 
I except that the test cards were placed randomly 
in the training deck on the last trial rather than 
being presented at a special session. For this last 
training trial Ss were told that they would not be 
given the correct answer for the cards. The EF 
merely presented each card, paused for Ss to make 
their inferences, then presented the next card. The 


‘test cards were presented as they came up in the 


deck. 
Results 


Inferential Accuracy. As in the previous 
experiment the first prediction was that Ss 
could learn to use the probabilistic cues to 
make accurate inferences. The curves in 
Figure 5 represent the mean number of 
errors made by Ss on each trial, for both 
Groups A and B. Because of the expanded 
deck (200 cards) and the greater number 
of classes, learning in the second experi- 
ment appears to be more difficult than in the 
first. The error curves in Figure 5 start 
higher and end higher than the comparable 
curves for the first experiment (see Figure 
1). In general, however, these results in- 
dicate that Ss’ inferences become increas- 
ingly accurate as training progresses. This 
decrease in errors with training is essentially 
the same as the corresponding results of the 
first experiment and, together with the 
other results reported here, supports the 
first prediction. 

Test Cards. The results for the second 
prediction, that Ss’ inferences for test cards 
would correspond to the class designated 
the best bet by the model, are presented 
in Table 4. The arrangement of the table 
is similar to the one presented for the first 
experiment except that the response dis- 
tributions and the chi-square results are 
presented for both Groups A and B. 

For Group A 9 of the 12 response dis- 
tributions differ significantly from chance. 
Of these 9 significant distributions 6»con- 
form to the prediction. This leaves 3 test 
cards with distributions which are signifi- 
cantly different from chance, but do not 
conform to the predictions based on the 
model. In short, Ss selected the theoretical 
best bet on only 6 of the 12 test cards. 

For Group B 9 of the 12 response dis- 
tributions are significantly different from 
chance. Of these 9 significant distributions 
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Fic. 5. Mean number of errors made on each trial by 16 Ss in Group A and 20 Ss 
in Group B: Experiment IT. 


7 conform to the prediction. This leaves 2 
test cards with distributions which are 
significantly different from chance, but 
which do not conform to the predictions 
based on the model. Thus, Ss in Group B 
selected the model designated best bet on 
only 7 of the 12 test cards. At best these 
results only partially confirm the previous 
results and are only equivocal support for 
the prediction that test card inferences 
correspond to the theoretical best bet. 

Confirming and Infirming Cards. The 
learning curves for the confirming and 
for the infirming cards, for both groups of 
Ss, are compared in Figure 6. The difference 
between the curves for the two kinds of 
cards is similar to the corresponding results 
for the first experiment and supports the 
hypothesis that the probabilistic aspect of 
the cues are a factor influencing the accu- 
racy of Ss’ inferences. 

Figures 7 and 8 contain the results for 
the confirming cards for Groups A and B, 
respectively. In these graphs the top curve 
represents the class—correct—with the 
highest H(k) value, the second curve the 
second highest #(k)-—incorrect—class, the 
third curve the third highest H (k)—incor- 
rect—class, etc. (The slight drop in accuracy 


at the end of training occurs during the last 
trial on which Ss were not told the correct 
answers for the cards.) Again, the results 
are highly similar to the results from the 
first experiment and support the prediction 
that Ss’ inferential accuracy for confirming 
cards increases as experience in the experi- 
mental environment increases. 

Figures 9 and 10 contain the results for 
the infirming cards for the two groups, 
respectively. Here the curve which is high- 
est on the last trials is for the correct class; 
this is not, however, the highest E(k) class 
according to the model. The second curve 
is for the incorrect class, which the model 
indicates is the best bet. The third, fourth, 
and fifth curves are the third highest, fourth 
highest, and fifth highest H#(k)—incorrect 
—classes. These results, which are similar 
to the results from the first experiment, 
prove the fourth prediction to be wrong. 
The results for all three groups of Ss (the 
students and both groups of cadets) show 
that inferential accuracy did not decrease 
for infirming cards. 

The fifth prediction, that Ss’ errors on 
infirming cards would result from giving 
the best bet rather than the correct answer, 
is supported in Figures 9 and 10. Here too 
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Fic. 6. Mean proportion of Ss giving the correct class as their response: Experiment II. 


(Upper curve = confirming cards. Lower curve = infirming cards.) 
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Fia. 9. Infirming card results for Group A: Experiment II. 
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Ita. 10. Infirming card results for Group B: Experiment II. 


the results agree with the results of the similar to the results of the first except that 
first experiment. for the second experiment the test card 

Alternative Inference Methods. The re- responses did not conform as closely to the 
sults of the second experiment are highly model as they did in the first experiment. 
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This lack of predictability for test cards, 
along with the consistent failure of the 
data to support Prediction 4, indicates that 
the model does not fully account for Ss’ 
inferences. 

While the results show that the model 
is not wholly correct, they also show that 
it is not wholly wrong. First, the theoretical 
best bet for infirming cards was most fre- 
quently chosen in the early stages of train- 
ing (see Figures 4, 9, and 10), even though 
the correct answer became more frequent 
as training progressed. Second, Predictions 
2 and 4, that Ss’ test card inferences would 
correspond to the model-derived best bet 
and that accuracy would decrease for in- 
firming cards, were only partially incorrect. 
Some of the test card results conformed 
to the theoretical best bet. These results 
indicate that the cue probabilism had an 
influence upon inferences but that it was 
not the whole story. It seemed possible 
that in the later stages of training and, 
under some conditions, for the test cards, 
Ss relinquished use of cue probabilism and 
adopted other methods of making infer- 
ences. Therefore, a detailed analysis of each 
S’s inference records was undertaken to 
investigate the nature of the alternative 
inference methods. 

The examination of the individual re- 
sponse records revealed that Ss’ inferences 
all followed a fairly consistent pattern, a 
pattern which is reflected in the curves 
presented earlier (see Figures 2-4 and 
6-10). In the beginning of training the re- 
sponses were essentially random. Then, for 
confirming cards, inferential accuracy 
started to increase rapidly and continued 
to rise toward perfect accuracy. For in- 
firming cards, however, the pattern was 
different. The infirming card responses 
started with initial randomness and then 
changed to the theoretical best bet (which 
was incorrect). Then Ss stopped giving the 
best bet as their response and started to 
give the correct—lower H(k)—answers for 
the infirming cards. 

These results permit the following hy- 
pothesis. In the early stages of training Ss 
began to learn about the probabilistic 
aspects of the cues (until these were learned 


the responses were random), then, as their 
distribution hypotheses developed Ss began 
to give the best bet as their inferences. 
However, as training progressed and the 
deck cards were seen repeatedly, Ss became 
familiar with the individual cards and their 
component cues. As this happened, Ss 


could increasingly rely upon recognition of 


individual cards and memory of their cor- 
rect (not the best bet) classes. This change 
in inference method was evidenced in the 
change from best bet to correct answers 
for the infirming cases but, of course, it 
was not reflected for confirming cards be- 
cause for these cards the correct and best- 
bet answers are the same.® 

In addition to clarifying the response pat- 
tern for the infirming cards the analysis of 
individual response records revealed how 
ability to recognize individual deck cards 
was linked to cue probabilism and how the 
test card results were related to ability to 
recognize individual deck cards. It was 
found that some infirming cards were recog- 
nized (i.e., the correct answer was consist- 
ently given) earlier in training than others. 
The mean speed with which cards came 
to be recognized was correlated with the 
magnitude of the H(k) associated with 
the cards’ correct classes. These correla- 
tions are .93 (n = 9 cards, p < .001) for Ss 
in Experiment I, .79 (n = 14 ecards, p < 
.001) for Group A, and .52 (n = 14 cards, 
p = .07) for Group B (Spearman rank- 
order correlation, two-tailed probabilities). 
These results indicate that Ss’ use of recog- 
nition as a method for making inferences 
was not independent of cue probabilism. 
When the H(k) for the correct class was 
high, i.e., the class was not the best bet 
but it was still fairly reasonable that the 
card could belong to the class, Ss quickly 
started to rely upon recognition of the card 
in order to determine its class. When, the 
E(k) was low, i.e., when in terms of the 
model it was less probable that the card 
could belong to the class which was actually 


® An unsuccessful attempt was made to account 
for Ss’ inferences in terms of a progression from 
the use of cues as single elements, through the use 
of pairs as elements, to the use of all three cues as 
a single pattern. The existence of the second stage, 
use of pairs of cues, was not supported by the data. 


18 Lee Roy BracH 


correct, Ss waited until later in training to 
utilize recognition.” 

The impression should not be given that 
Ss used the model or recognition in an all- 
or-none way; indeed, there were great in- 
dividual differences in the degree to which 
recognition was utilized. The degree to 
which it was used can be estimated by the 
relative numbers of correct versus best-bet 
answers given for the infirming cards. Some 
Ss appeared to start to use recognition early 
in training; others used it only for those 
infirming cards for which the E(k) for 
the correct class was high. The nonrecog- 
nizing Ss, those who depended more heavily 
upon cue probabilism, tended to give the 
best bets as their responses to infirming 
cards all through training even though they 
were frequently wrong. These individual 
differences in the use of recognition provided 
the key to understanding the test card 
results. 

The test card results show that for some 
cards there were frequently two answers 
which were more commonly given. One of 
these was usually the theoretical best bet; 
the other was frequently the answer which 
was correct for a deck card which possessed 
cue values similar to the test card’s cues. 
This suggested that Ss who depended upon 
recognition in the training trials might 
have tended to base their test card answers 
on the similarity between the test card 
and a familiar card from the training deck. 
To investigate this, the proportion of cor- 
rect responses for infirming cards was used 
as an indication of the degree to which 
each S depended upon recognition during 
training. This was correlated with the pro- 
portion of S’s test card inferences which 
could be accounted for by the similarity 
between the test card and a card from the 
deck. These correlations were .71 (n = 


*The finding that the speed of adopting recog- 
nition is related to cue probabilism is further sup- 
ported by the fact that the frequency of the occur- 
rence of a given kind of card in the deck (eg., 
there were six E52 cards, three F73 cards, etc.) does 
not correlate with the speed of adopting recogni- 
tion for either confirming or infirming cards. 

*'To be considered similar to a test card a deck 
card could have only one of the three cue values 
different from the cues on the test card. There 
were 5 of the 14 Experiment I test cards and 6 of 


10 Ss, p < .05) for Ss in Experiment I, 
37 (n = 16 Ss, p & .15) for Group A, and 
51 (n = 20 Ss, p < .05) for Group B 
(Spearman rank-order correlation, two- 
tailed probabilities). These correlations sup- 
port the hypothesis that Ss who rely on 
recognition during training also tend to 
make “similarity-based” inferences for the 
test cards rather than “model-based” in- 
ferences. It is also important to note that 
the test cards did not have to be similar 
to infirming cards to receive similarity- 
based responses; similarity to confirming 
cards also influenced Ss’ inferences. This 
implies that the familiarity provided by 
the training trials permitted these Ss to 
remember both the confirming and the in- 
firming cards and to utilize this information 
for their test card inferences. 

Probability Matching and Theory. The 
paradigm for probability learning studies 
is that Ss are given a ready signal and asked 
to predict which of two or more alternative 
events will occur. The ready signal may 
be a light, a buzzer, a verbal “ready,” or 
the like; and the events may be different 
lights on a display, symbols on cards, digits 
writen on a blackboard, etc. The events 
occur randomly and with different relative 
frequencies. 

It can be argued that the present theory 
is applicable. to the probability learning 
situation; that the ready signal is a one- 
valued cue, d = 1, and that the subsequent 
events are classes, k = a through 2. In 
this situation the theory would predict 
that Ss should tend to maximize, i.e., to 
always select the most frequently occurring 
event. This, of course, is not what usually 
happens. In most of the probability learning 
studies Ss’ frequency of response approxi- 
mates the relative frequency of occurrence 
of the corresponding events. 

It is possible that differences between 
the experimental situations in the present 
research and in the probability learning 
studies make the theory inapplicable to the 
latter. However, it is possible that the 
distributions of responses for cards in the 





the 12 Experiment II test cards for which the 
model-based best bet and the answer dictated by 
similarity to a deck card were not the same answer. 
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present experiment result from Ss’ tendency 
to match their response frequencies to the 
theoretical probabilities. Therefore, the fol- 
lowing analysis was performed. 

If Ss tend to maximize rather than to 
match probabilities, the proportion of Ss 
selecting the best bets for the test cards 
should be significantly higher than the 
best bets’ probabilities. Using those test 
cards which had nonchance response dis- 
tributions, this difference was significant at 
or below the .05 level for all three groups 
of Ss (Mann-Whitney U test, one-tailed). 

By the same reasoning, if Ss are match- 
ing probabilities rather than maximizing on 
the training cards, the response curves for 
the best bet (see Figures 2-4 and 6-10) 
should approximate the average of the 
best-bet probabilities for the corresponding 
cards. For Experiment I these averages are 
.69 for the best bet for the confirming 
cards, .50 for the best bet for the infirming 
cards, and .26 for the correct class for the 
infirming cards. For Experiment II these 
averages are .54 for the best bet for the 
confirming cards, .39 for the best bet, and 
.22 for the correct class of infirming cards. 
Comparison of the curves in the figures with 
these values provides no support for the 
probability matching hypothesis. Thus, for 
both test cards and for the training cards 
there is no evidence of Ss’ responses match- 
ing the theoretical probabilities. 


DISCUSSION 


In general, the present theory appears 
to provide a promising line of inquiry. The 
results indicate that for unfamiliar objects 
cue probabilism is an important determi- 
nant of Ss’ inferences and that with in- 
creased familiarity the use of recognition 
and of similarity becomes important. This 
conclusion is supported, first, by the curves 
representing the infirming cards. Figures 
4,9, and 10 show that early in training the 
theoretical best bet was always given by 
the highest proportion of Ss. Later, as 
training proceeded, and Ss supposedly be- 
came able to recognize the cards, the re- 
sponse curve for the correct class rose and 
exceeded the curve for the best bet. 

The test card results also support the 
conclusion. The response distributions for 


the test cards (see Tables 2 and 4) fre- 
quently have two high points: one cor- 
responding to the theoretical best bet, and 
the other corresponding to the “similarity- 
based” response. Unless they are the simi- 
larity-based responses, the classes having 
the lowest theoretical E(k) are infrequently 
chosen. 

The influence of cue probabilism on Ss’ 
utilization of recognition is demonstrated by 
Ss’ individual response records. These 
records reveal that the magnitudes of the 
E(k) values derived from the model cor- 
respond to the order in which Ss adopt 
recognition for classifying infirming cards. 
The individual response records also showed 
that Ss who depended most heavily upon 
recognition during training also depended 
most heavily upon similarity in responding 
to the test cards. The latter finding indicates 
that there are individual differences in how 
Ss utilize the various alternative inference 
methods; differences which are perhaps 
linked with motivation, intelligence, and 
other such factors. 

The finding that the present results do 
not indicate that Ss were matching proba- 
bilities may be due to some fundamental 
difference between the present experimental 
situation and the usual probability learning 
situation; a difference which causes Ss to 
use different strategies, decision rules, sets, 
or the like, in the two situations. When dis- 
cussing the differences between the two 
situations it is well to keep in mind that 
neither situation is as simple as it first 
appears. A number of studies (e.g., Cotton & 
Rechtschaffen, 1958; Gardner, 1958; Hake 
& Hyman, 1953; Jarvik, 1951; Jones 1961; 
McCormack, 1959) have demonstrated vari- 
ous factors which contribute to the under- 
lying complexity of the probability learning 
situation. No doubt similar analyses would 
prove the present situation to be equally 
complex. One important aspect of the proba- 
bility learning situation is Ss’ tendency to 
ignore the independent existence of each 
trial (Hake & Hyman, 1953; Jarvik, 1951). 
Instead the experiment apparently is looked 
upon as a unit in itself of which one aspect, 
the sequence of outcomes, is variable. The 
variable outcomes are seldom assumed to be 
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random, however, and Ss’ are particularly 
alert for real or imagined patterns in the 
sequence. Attention to patterns is not sur- 
prising because Ss seldom encounter purely 
random sequences of highly similar, cue- 
impoverished events in the normal environ- 
ment. When such a sequence does occur, as 
in gambling or perhaps in the patterns of 
lights which flicker on and off on warning 
signals, advertising signs, or strings of 
flickering Christmas tree lights, there is a 
very strong tendency to impose order upon 
the events. The imposed order is probably 
not independent of whatever parameters 


determine the occurrence of the events, but 
it is keyed, nonetheless, to the sequence 
rather than to the individual events as 
distinct Os. 

In the present experiment an effort was 
made to prevent Ss from relying upon the 
order of Os’ occurrence in the deck. The 
Ss were instructed to think of each card as 
an individual O (ef. page 7), and sub- 
sequent comments made by various Ss in- 
dicated that for the most part they followed 
the instructions. These instructions may ac- 
count for the apparent lack of probability 
matching in the present results. 
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OF OBJECTS! 
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The present theory, based on the results of the author’s previously reported 
research, concerns the ways in which Ss make inferences about what objects 
(Os) are, i.e., about their class memberships, and about their covert prop- 
erties, i.e., about their unknown cue values. Mechanisms and conditions are 
described for 3 inference methods: for familiar Os (recognition), for unfa- 
miliar Os which are similar to a familiar O (assimilation), and for unfamiliar 
Os (identification). In addition, the way in which this knowledge is retained 
and how it is revised in light of new information are also discussed. 


HE course of a man’s life is intricately 
i interwoven with the multitude of per- 


sons, things, and events which he encoun- 
ters, some repeatedly and some only once. 
Interaction with these objects (Os) pre- 
sumes knowledge about them, knowledge 
which allows the subject (S) to plan his 
part of the interaction properly and to have 
some notion of how O will react to his ac- 
tions. Knowledge about Os can be divided 
into two general categories: (a) knowledge 
about what kind of O it is, 1.e., the general 
class of Os to which it belongs; and (6) 
knowledge about the nature of O, 1e., O’s 
properties or characteristics. When an O is 
first encountered S obtains some informa- 
tion about it through immediately apparent 
characteristics, such as contour, color, and 
sound, from which he attempts to determine 
the kind of O it is. Then the knowledge 
about what O is, together with the knowl- 
edge about its immediately apparent char- 
acteristics, can be used to decide about O’s 
covert properties. The present theory is con- 
cerned with how Ss utilize information 


1 Opinions or conclusions contained in this report 
are those of the author and not to be construed as 
necessarily reflecting the views or the endorsement 
of the Navy Department. The valuable help and 
advice of K. R. Hammond, M. Wertheimer, M. B. 
Jones, J. R. Berkshire, D. E. Bailey, Alice Garman 
and Elizabeth McPeters is gratefully acknowl- 
edged. This work was done at the United States 
Naval School of Aviation Medicine, Pensacola, 
Florida. 


about the known attributes of Os in order 
to make inferences about Os’ unknown at- 
tributes. 

In brief, the theory assumes that each S 
possesses a multidimensional cognitive space 
which is defined by the cue dimensions to 
which he attends. All of the cue values S 
knows to be possessed by O define a loca- 
tion in this space, the cognitive representa- 
tion of O, called 6. If, when an O is referred 
to the space, a @ already exists at or ex- 
tremely near the location, O is said to be 
recognized as the previously experienced O 
corresponding to 6. Then the cue values and 
the class associated with 6 can be assumed 
to be applicable to the recognized O. If the 
location defined by O’s cue values falls 
somewhat further from a 6 in the space, S 
can still assume 6 and O to be highly simi- 
lar in class membership and in covert cue 
values. Here the class and covert cue values 
associated with the similar 6 can be assimi- 
lated for O and assumed to be sufficient un- 
til further information indicates the con- 
trary. If the location defined by O’s cue 
values does not fall close to a 6 in the space, 
S can fall back on a third method called 
identification, which relies on the statistical 
properties of groups of #s which possess cue 
values similar to the ones O is known to 
possess. These three inference processes 
comprise the major portion of the theory. 
The remainder consists of the methods by 
which the inferences are checked against 
subsequent information and by which re- 
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visions are made in an effort to maintain 
accuracy. 


ATTRIBUTES OF Os 


In their Study of Thinking Bruner, Good- 
now, and Austin (1956) define an attribute 
of an O as 


any discriminable feature of an [O] that is sus- 
ceptible of some discriminable variation from [O] 
to [O] [p. 26]. 


These authors also quote Boring’s (1942) 
similar definition: 


A stone is shape, color, weight, and kind of sub- 
stance In complicated relation. When such de- 
scriptive ultimates are general properties which 
can vary continuously or discretely, when they are, 
in short, parameters, they may, if one chooses, be 
called attributes of the object described [p. 25]. 


As these definitions indicate, Os are de- 
fined by their attributes, and attributes are 
those aspects of Os which permit discrimina- 
tions among them. It is assumed that there 
are two types of attributes associated with 
Os, classes and cue values. 


Classes 


A class is an attribute of Os which per- 
mits only dichotomous (presence or ab- 
sence) judgments about its association with 
any one O. It is a nominal attribute of an 
O which does not permit discriminations 
among Os which possess it. For example, 
knowing that two Os are members of the 
class “cow” does not aid in discriminating 
between them—discriminations are based 
upon other attributes such as color, size, and 
markings, which are cues. Classes exist as 
a result of the psychological operation of 
putting Os together into various groups and 
of assigning these groups specific names. 
While they can be subdivided on the basis 
of discriminations made on cue dimensions, 
e.g., red houses and white houses, the class 
still remains the group of Os to which the 
discrimination refers, and at no time can 
various members of one class be discrimi- 
nated from one another on the basis of 
knowledge of the class alone. 


Class Hierarchy 


It is obvious that an O is associated with 
many different classes and that knowledge 
of its membership in one class often implies 
membership in a number of other classes. 
This is because classes exist in a hierarchy, 
which runs from an infinite number of small 
classes (each containing only one O) at the 
bottom on up through classes which are 
fewer and fewer in number, but increasingly 
larger in size. At each level of the class 
hierarchy the classes are all mutually ex- 
clusive. 

At each level the set of Os (or, rather, 6s 
because they are already occupants of the 
congitive space) which S has previously ex- 
perienced is partitioned into classes. These 
classes, or parts of classes, can be combined 
with other classes, or parts of classes, on the 
same level to form higher level classes. For 
example, a 6 may be a member of the class 
“cottage” which can be combined with the 
classes “apartment,” “house,” “mansion,” 


HIERARCHY LEVEL 








Fia. 1. The hypothetical structure of the cogni- 
tive space at various levels of the class hierarchy. 
(At the lowest level each class contains one 6 and 
at the highest level one class contains all 6s.) 
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etc., and subsumed under the higher order 
class “residence.” This in turn can be com- 
bined with “store,” “depot,” “bank,” and the 
like, to form the yet higher order class 
“structure,” and on and on. 

Class Function. In the course of S’s past 
experience with a 6 he has experienced it as 
the member of a number of different classes. 
This is due to alternative names for the 
same class, errors in labeling, multiple-class 
memberships (such as a man belonging to 
religious, professional, and social groups), 
etc. As a result it is not always clear to 
which class a given 6 belongs at the time the 
information is needed to aid an interaction, 
and thus a decision must be made about 6’s 
membership. 

The rule for determining which of the 
k = a through 7 classes at a specific hier- 
archy level will be assigned to a particular 
6 is called the class function. The class func- 
tion is in turn determined by two variables; 
the relative frequency with which 6 and a 
particular class have been associated and 
whether or not the class is the one with 
which 6 was last associated. These two var- 
iables are functions on the set of classes for 
each @ and their relative importance can be 
treated as weights on the functions.? For 
each @ the sum of these two weighted func- 
tions on the set of classes defines the class 
function. The class for which the sum is 
greatest should be the one assigned to the 
6 under consideration. 

Inference within the Class Hverarchy. 
When one of 6’s class memberships is known, 
its memberships at many of the higher levels 
in the hierarchy can be inferred on the basis 
of the proportion of 6’s fellow class members 
known to belong to each of the higher order 
classes. Similarly, 6’s lower order classes can 
be inferred from the proportion of 6s from 
each of the lower order classes which are 


2 The importance of the last associated class or 
cue value should decrease as the time between the 
last encounter and the present encounter increases. 
This lends consistency to the assignment of classes 
and cues to the 6s by insuring that previous infer- 
ence errors, temporary changes in the QO, etc., will 
have less influence as time goes on and that the 
highest relative frequency will exert more and more 
influence on the assignment. 


known to constitute the class to which it be- 
longs. Because these proportions can be used 
to make such intrahierarchy inferences it 
is important to note that, because there are 
fewer classes at higher than at lower levels, 
a large proportion of O’s known classes will 
usually go into one higher order class. On 
the other hand, O’s known class is usually 
made up of a number of lower order classes. 
This means that inferences from O’s known 
class downward in the heirarchy usually 
will be based on more rectangular distribu- 
tions of proportions than will inferences 
made upward in the hierarchy. Conse- 
quently, downward inferences are frequently 
more uncertain than are upward inferences. 

Selection of a Hierarchy Level. When an 
O is encountered in the environment S must 
select a level in the class hierarchy at which 
to attempt a class inference (i.e., he must 
select the set of classes to be considered for 
O’s membership). While the lowest level 
possible would be best, because this permits 
intrahierarchy inferences of O’s membership 
in a number of higher order classes, this is 
seldom possible. To classify an O as a mem- 
ber of a very low level class it is necessary 
to know a good deal about O’s attributes, 
because lower level classes are more tightly 
defined than higher order classes, e.g., struc- 
ture versus cottage, with the lower levels 
containing only a few very similar Os in 
each class. Because S seldom possesses much 
information at the moment he is forced to 
make his decision he usually must settle for 
a higher order class. 

A second condition which tends to make 
S select a high level for the class inference 
is that the decision about the specific class 
is more likely to be correct at higher levels. 
(There is a smaller possibility of error in 
assigning an O to the broad class, structure, 
than in deciding it is a member of the 
smaller, more tightly defined class, cottage. ) 
These two factors, lack of information about 
O and desire to be correct in classifying it, 
tend to make S select a high level in the 
class hierarchy for deciding O’s class. 

On the other hand, after S decides what 
O is, he usually intends to make further in- 
ferences about its cue values. These cue in- 
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ferences are based on knowledge he has 
about O’s fellow class members. As a result, 
he does best to select a lower order class 
initially because such classes are smaller 
and are composed of more homogeneous Os. 

In general then, the higher the order of 
the class the more likely is the identification 
of O to be correct; the lower the order of 
the class the more likely are subsequent cue 
inferences to be correct. The class level at 
which these two factors balance one another 
will be the one selected for the identification 
of an unfamiliar O. The mechanism for de- 
termining where these factors balance will 
be more closely examined later in the dis- 
cussion. 


Cue Dimensions 


The classes are defined as attributes for 
which only two values exist: presence or 
absence. For some attributes, however, it 
is possible to discriminate among Os not 
only on the basis of the presence or absence 
of an attribute, but also on the basis of the 
degree to which the attribute is present. For 
example, some Os not only possess the at- 
tribute of weight but some are heavier or 
lighter than others. When such discrimina- 
tions can be made among Os possessing an 
attribute it qualifies as a cue dimension and 
the degree to which each O possesses one of 
these attributes 1s O’s value on the cue di- 
mension. (A rule of thumb for distinguishing 
cues and classes: Proper nouns, common 
nouns, collective nouns, and personal pro- 
nouns are classes while abstract nouns, ad- 
jectives, participles, and adverbs are usu- 
ally cue values.) 


Cue-Dimension Hierarchy 


Just as classes exist in a hierarchy so too 
is there a hierarchy of cue dimensions. The 
cue-dimension hierarchy extends from the 
higher order, culturally defined attributes, 
like beauty or status, on down to the sen- 
sory level. Most of the meaningful cue val- 
ues we associate with Os are only remotely 
related to the original sensory stimulation. 
Stimuli emanating from O stimulate S’s re- 
ceptors and produce sensations, e.g., loud- 
ness, light, color, heat, ete. These lowest 
order cues are then used to infer slightly 


higher order cues such as edge, texture, sur- 
face, and size. In turn, these cues lead to the 
inference of even higher order cues such as 
square corners, planes of surface, orienta- 
tion in space, extension, and so on. This pro- 
cedure leads S up the cue-dimension hier- 
archy until he finally arrives at a point at 
which he has enough information both to 
infer O’s class and to place its cognitive 
representation (its 6) appropriately in the 
cognitive space. After this point, knowledge 
of both the class and the cue values deter- 
mine subsequent inferences about the yet 
higher order cue values. 

Cue Function. The method of assigning a 
cue value to a @ is parallel to that used for 
assigning classes. This rule, called the cue 
function, treats the cue values from one di- 
mension as a set of elements which are to 
be assigned to each 6 in the cognitive space. 
Because of temporary and permanent 
changes in O over time, variations in view- 
ing conditions, etc., any 9 may have had a 
number of associated values from the di- 
mension. This history generates a relative 
frequency distribution for the cue values 
which, together with the last value asso- 
ciated with 6, determines the cue value to 
be attributed to 6. These variables can be 
treated as functions on the cue space and 
weighted for relative importance. The cue 
value for which the sum of the weighted 
values of these two factors is greatest is the 
value which S should assign to the 6 under 
consideration. 

Class and the Cue-Dimension Hier- 
arches. The similarity between the class 
heirarchy and the cue-dimension heirarchy 
is not so great as it at first appears. The 
class hierarchy, as illustrated in Figure 1, is 
composed of sets of classes into which @ are 
repartitioned at every level of the hier- 
archy. Knowledge of its class at one level 
permits intrahierarchy inferences about 6’s 
class memberships at other levels of the 
hierarchy. On the other hand, the cue hier- 
archy does not consist of the reassignment of 
elements at each level. The levels of this 
hierarchy only indicate the abstractness or 
“remoteness” (Brunswik, 1956) of the vari- 
ous cue dimensions. The 6’s value on each 
succeeding higher order cue dimension must 
be inferred from the lower order cue values. 
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CoGNITIVE SPACE 


The assumed structure of the cognitive 
space is similar to that of analogous con- 
cepts proposed by Kelly (1955); Osgood, 
Suci, and Tannenbaum (1957) ; Overall and 
Williams (1961); Sarbin, Taft, and Bailey 
(1960), and others. It is a multidimensional 
space which is defined by the cue dimensions 
to which S attends. 

The cognitive space provides an informa- 
tion storage and retrieval system for Os 
which S encounters. Each cue value asso- 
ciated with an O can be located on the ap- 
propriate cue dimension bounding the space. 
All of these values define a particular point, 
called 6, which is unique to O and which is 
the key both to retention of information 
about O and to retrieval of this knowledge 
when it is needed. The @ serves to retain 
knowledge through its location because its 
location is a summary of everything S 
knows about O’s cue values. It serves in the 
retrieval of knowledge about cue values be- 
cause specification of its location designates 
the appropriate 6 and permits the deter- 
mination of its cue values through the cue 
function. It serves in the retrieval of knowl- 
edge about O’s class membership because 6 
is linked to the classes by the class function. 


INFERENCE OF CLASS MEMBERSHIP 


Recogmtion 


When an O appears before S, only a few 
lower level cues are apparent. Using these 
cues, S’s first task is to decide whether or 
not this O has ever been experienced before, 
1e., if it is familiar or unfamiliar. If it is 
recognized as familiar, a great deal may al- 
ready be known about it and this knowledge 
can be utilized in the ensuing interaction. 
Because of changes in situational condi- 
tions, however, as well as changes in an O 
over time, the immediately apparent cue 
values associated with an O may be slightly 
different from one encounter to the next. 
The S’s problem is to decide whether the cue 
values associated with a 6 in the space are 
sufficiently similar to those associated with 
the present O to permit the assumption that 
6 derives from a previous encounter with 
this same O. 

When an O is encountered the first step in 


the inference process is to determine at 
which level of the class hierarchy the infer- 
ence will be made. That is, S must select the 
set of classes which he will consider as pos- 
sible candidates for O’s membership. The 
selection is determined by §S’s desire to be 
correct in his class inference, his desire to 
be correct in his subsequent cue inference 
(ef. section on Selection of a Hierarchy 
Level), and the context in which the infer- 
ence is being made. 

The second step in the inference process 
is to use O’s immediately apparent cue val- 
ues to tentatively define a cell in the cog- 
nitive space, called ¢. Next, the distance is 
measured between ¢ and every @ in the cog- 
nitive space. The smaller the distance be- 
tween ¢ and the nearest 6, the more likely it 
is that they both derive from the same O. 
There is, however, some critical distance be- 
yond which S will reject the hypothesis that 
both ¢ and the nearest 6 derive from the 
same O and therefore that O is not recog- 
nized. (See Ames, 1949, and Arnoult, 1956, 
concerning Ss’ assumption of identity be- 
tween similar Os.) The size of the critical 
distance is determined by S’s motivation to 
be correct in his inferences. If it is very im- 
portant to be correct, the size of the critical 
distance will be very small and the nearest 
6 will have to be nearly identical to ¢ to be 
accepted. If accuracy is less important 
greater dissimilarity can be tolerated.4 


* No specific method has been given for meas- 
uring distances in the cognitive space because so 
many alternatives are available and, for research, 
the final choice would depend on the conditions of 
the experimental situation. One method might be 
to use D? (Cronback & Gleser, 1953; Osgood & 
Suci, 1952) computed for the cue dimensions com- 
mon to ¢ and each @. Similarly, variations of the 
sealing procedures developed for decision theory 
could be used, and the scaling techniques similar 
to that developed by Ekman, Goude, and Waern 
(1961) might also be of use. 

The decision paradigm, as well as the concept 
of the critical value fluctuating as a function of mo- 
tivation, closely resembles statistical decision the- 
ory and is based on the Tanner and Swets (1954) 
approach to stimulus detection. 

* When a number of 6s are all the same distance 
from ¢ and all lie within the critical range around 
it, Ss apparently match the relative frequency of 
their choices of each @ to the relative frequency 
with which they have previously seen in each @, 
ie., probability matching (Binder & Feldman, 
1960). In the laboratory S is not permitted to ob- 
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Fic. 2. Probability of use of each inference 
method as a function of the distance between @ 
and ¢. 


Another determinant of whether or not 
one 6 will be accepted as having derived 
from O in question is S’s “familiarity” with 
the @ (Arnoult, 1956; Nobel, 1954). If one 
6 is more familiar than another, even though 
their distances from ¢ are equal, S should 
be more inclined to accept the former as 
having derived from the present O. 

Familiarity is primarily determined by 
the frequency with which 6 has been en- 
countered. However, it is somewhat more 
than just frequency, it is also influenced by 
the duration of the encounters and how 
much S knows about @’s various class mem- 
berships and cue values. In some cases, of 
course, this reduces primarily to frequency 
of past encounters, and in these cases the 
influence of frequency of past encounters 
upon recognition is well documented by a 
sizable body of literature (e.g., Arnoult, 
1956; Bruner, 1957; King-Ellison & Jen- 
kins, 1954; Postman & Rosenzweig, 1956; 
Solomon & Postman, 1952). However, a 
more general definition of familiarity is 
needed for our present purposes. Here fa- 
miliarity is defined as the sum of the num- 
ber of times a 6 has been classified and the 
number of cue values known for it. Thus, 
while familiarity is closely related to rela- 
tive frequency of encounter, it also takes 
total knowledge into account. 

When it is decided that ¢ les close 





tain additional cues. Ordinarily, however, another 
cue value would be sought in order to break the 
deadlock. 


enough to a 6 to assume that they derive 
from the same O, the class function serves 
to assign a class to 6 and, in turn, to O. Then 
S can assume O is a member of the class 
and proceed with the interaction. If 6 has 
never before been assigned a class on that 
class hierarchy level, S has two courses of 
action open to him. First, he can select a 
higher or lower hierarchy level and attempt 
to assign one of the classes on the new level. 
Then he can make intrahierarchy inferences 
up or down to the level at which he pre- 
viously wanted to make the inference. 
Second, if the intrahierarchy inference pro- 
cedure is too involved or does not yield suffi- 
ciently certain inferences, S can fall back 
upon the identification method which will 
be explained below. 


Assimilation 


When the distance between ¢ and the 
nearest 6 in the cognitive space is greater 
than the critical value, S can assume that 
they do not derive from the same O. How- 
ever the matter does not end there. If ¢ and 
6 are reasonably similar, it is prudent to 
consider whether this similarity is not par- 
alleled by a similarity in class membership 
and in as yet uninferred cue values. Thus, 
even though O is probably not the same one 
which has been encountered before, 6’s class 
still may be appropriate because @ and ¢ 
are so similar in their other aspects. When 
S can clearly discriminate between 6 and ¢ 
and yet he adopts 6’s class for ¢, and there- 
fore O, the process is called assimilation. 

The hypothetical curves in Figure 2 de- 
scribe the effect of the distance between ¢ 
and the most similar 6 upon S’s choice of 
an inference method. First, when familiarity 
is held high and constant, up to the critical 
distance, ¢ and 6 are assumed to derive from 
the same O and the present O is recognized 
as belonging to the class associated with 6. 
Just beyond the critical distance @ and 6 
are still extremely similar and even though 
S does not recognize O he is still likely to 
assimilate 6’s class for O. As the distance in- 
creases, the adoption of the most similar 6’s 
class for O becomes less likely until at large 
distances it becomes extremely unlikely, 
and at the same time the probability in- 
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creases that S will use the identification 
method which will be discussed presently. 

In Figure 3 are presented the hypothetical 
relationships between familiarity with 6 and 
the probability of S using recognition, as- 
similation, or identification to make his in- 
ference. These two curves are from a family 
of curves which change as a function of the 
difference between ¢ and 6 defined in Fig- 
ure 2. If one placed the right margin of Fig- 
ure 3 against the left margin of Figure 2, 
the result would be a _ three-dimensional 
graph of the family of curves. 

When the distance between @ and ¢ is 
held small and constant, an increase in S’s 
familiarity with 6 results in an increase in 
the probability that S will use assimilation 
or recognition to infer O’s class member- 
ships, and a decrease in the probability that 
he will use identification. On the other hand, 
a larger distance between @ and ¢ results in 
a lower assimilation-recognition curve and 
a higher identification curve with corre- 
sponding changes in probabilities for any 
given frequency values. 


Identification 


When an O’s cue values define a ¢ which 
is too different from the nearest 6 for either 
recognition or assimilation, S must obtain 
the class through identification. Unlike as- 
similation, identification is based upon a 
large number of 6s which are similar to O, 
rather than upon just one. Inferences based 
on a large sample of similar 6s are more 
likely to be correct than those based on just 
one previously experienced @. So, when S’s 
inferences for unfamiliar Os require a high 
degree of accuracy, assimilation is likely to 
be abandoned and identification used to in- 
fer Os’ class. 

The mechanics of the identification proc- 
ess were outlined in the previous mono- 
graph. However, to refresh the reader’s 
memory it will be recalled that the identi- 
fication process, like the recognition and 
assimilation processes depends upon the im- 
mediately apparent cue values associated 
with O to be identified. Each of these values 
has been experienced by S in the course of 
his past experience with other Os, Os which 
were members of classes and which are rep- 


resented by 6s in the cognitive space. Given 
a specific level of the class hierarchy at 
which to work, each cue value associated 
with O defines a relative frequency distri- 
bution composed of 6s which possess the cue 
value, d, and which are members of each of 
the different classes, k = a through 7. For 
each cue value the proportion of 6s in each 


class, written P(k/d), constitutes 1/n of 


the evidence for the class in question being 

the best choice for S’s inference. The sum of 

P(k/d) j 

the ake i over all of O’s known cue values 

for each of the k = a through 7 classes gives 

the proportion of the total evidence yielded 

by the n cue values that indicates that each 
class is the best bet for O’s membership. 
“. P(k/d 

E(k) = >> P(k/d) 


d=1 n 


[1] 


Where, 

k = a specific class under consideration 
as a possibility for O’s identity; k = a 
through 2 

d = a dimension on which O’s cue value 
is known; d = 1 through n 

The S should choose the class with the 
highest H(k) for his inference about O’s 
class. When this inference has been made ¢ 
takes on the class as an attribute and be- 
comes a permanent 6 in the cognitive space. 

Some Properties of the Equation. The 
simple additive form of Equation 1 was de- 
liberately selected because there is some evi- 
dence that it may be sufficient and because, 
while it is possible that other, more complex 
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Fic. 3. Probability of the use of each inference 
method as a function of familiarity with the 6. 
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forms will prove to be more adequate, it is 
probably best to start with the simplest as- 
sumptions and then complicate the theory 
where need be. (Anderson, 1961, successfully 
used a similar additive equation to predict 
Ss’ impressions of Os based on descriptive 
adjectives. ) 

Equation 1 has two particularly interest- 
ing properties. First, if any of the — d) 
= 0, the equation does not reduce to E(k) 
= 0 as it would if the equation were multi- 
plicative. This seems to agree with common 
sense. Just because the cue value has never 
before been associated with a member of a 
particular class does not mean that it is im- 
possible that the present O belongs to the 
class; this may just be the first time S has 
encountered this situation. 

A second interesting property of the equa- 
tion is that it is not necessary to weight the 


P (k/d) 
n 


various according to the ability of 


their corresponding cue dimensions to dis- 
criminate among the classes. Such a weight- 
ing procedure would be used in a correla- 
tional analysis (Hoffman, 1960; Todd, 
1954), but is unnecessary here because the 
RiGs fd) as 
eis will be nearly the same for every k 
for a nondiscriminating cue dimension. For 
discriminating cue dimensions (i.e., highly 
correlated with the classes, e.g., Summers, 
z oe will be 
grossly different for different k’s; this, after 
all, is what produces the correlation. How- 
ever, with appropriate changes in the nu- 
merator, the equation does leave open the 
P (k/d) 
n 





1962) the various values of 


possibility of adding weights to the 


in order to take into account the salience of 
the various cue dimensions.® In the present 
treatment we will ignore this possibility to 
simplify discussion. 

Theory and Data. The hypothesized na- 


>In the further examination of the determina- 
tion of H(k) it may be necessary to consider only 
the most salient cues and those which have non- 
rectangular distributions on the classes. This would 
limit the size of n and thereby prevent the values 
of all of the #(k) from becoming indiscriminably 
small. 


ture of the class inference methods and the 
contents of Figure 2 are based upon the re- 
sults of the study reported in the preceding 
monograph. In that study it was found that 
in the early stages of training, when Os were 
unfamiliar to Ss, inferences corresponded to 
the identification method’s maximum E(k). 
As training progressed, and Os became more 
familiar, Ss’ inferences for confirming cases 
continued to correspond to the maximum 
E(k). (For these cases, of course, recogni- 
tion, assimilation, and identification all lead 
to the same response). For infirming cases, 
however, Ss’ inferences changed from the 
(incorrect) maximum H(k) to the correct 
class as familiarity with Os increased. 

For the test cases, on the other hand, rec- 
ognition could not be used; Ss had never 
seen the specific cue combinations which 
comprised Os. However, Ss could use assimi- 
lation for Os which were similar to Os ex- 
perienced during training. And, indeed, it 
was found that Ss used the identification 
method, i.e., assigned a test O to the high 
E(k) class, only when the test O was not 
similar to any O in the training set. When 
the test O’s cue values are similar to those 
possessed by a familiar O (either confirming 
or infirming), Ss’ inference for the test O’s 
class was the one associated with the fa- 
miliar O, 1.e., the class was assimilated from 
the familiar O. 

These results have determined the form 
of the present theory. The test O results in- 
dicate that while Ss use cue probabilism for 
unfamiliar Os, they tend to depend heavily 
upon the similarity between previously ex- 
perienced Os and O for which the inference 
is to be made in order to reach a decision 
about the latter’s class. The infirming case 
results imply that before recognition or as- 
similation can take place it is necessary for 
the old O to have been experienced a num- 
ber of times and that before this familiarity 
is attained Ss utilize cue probabilism in or- 
der to make their inferences. 

Apparently Ss first attempt to infer an 
O’s class by the recognition method, accept- 
ing only familiar @s as possible candidates. 
If no familiar @ is sufficiently similar to ¢ 
to permit recognition, S resorts to assimila- 
tion—still requiring that a familiar 6 be the 
basis of the assimilation. If there is still no 
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sufficiently familiar 6, S uses the identifica- 
tion method which apparently utilizes every 
6 in the cognitive space rather than just the 
most frequently experienced. 


INFERENCE OF CUE VALUES 


When an O is encountered, it can be con- 
sidered the possessor of two kinds of cue 
values, known and unknown. The known 
cues include all of the immediately appar- 
ent aspects of O (which are used to make 
inferences at the lower levels of the cue di- 
mension hierarchy and which form the basis 
for the inference about O’s class) and all of 
the cue values which the cue function as- 
signs to O’s 6 in the cognitive space. All 
others are designated unknown cue values. 
The most important unknown cue values 
are the higher order ones upon which Ss’ 
interactions with Os are based, e.g., intelli- 
gence, value, use, and friendliness. These 
higher order cue values, more so than an O’s 
class, are the end point in S’s search for 
knowledge about O. It is only by knowing 
these attributes of O that S can properly 
plan his interaction and attain his long 
range goals, 


Recogmtion and Cue Values 


Just as recognition of an O reveals its 
class membership, so too does it reveal 
many of O’s cue values which, while known 
from past experience, are not apparent at 
the moment. These unknown cue values are 
associated with 6 in the cognitive space and 
are available for use when the proper @ is 
located. In this way recognition permits the 
utilization of past experience in the plan- 
ning and execution of interaction with O. 
The cue values associated with 6 are, how- 
ever, only those which past experience has 
provided. If other, as yet unknown, cue val- 
ues are required for the interaction, S must 
fall back on either assimilation or identi- 
fication in order to infer them. 


Assimilation and Cue Values 


When an O is not recognized, or when spe- 
cific attributes of a recognized O have never 
been experienced, S can utilize assimilation 
to provide the unknown cue values. The 
procedure is the same as for an assimilation 


of classes; the needed cue values can be as- 
similated from the most similar @ in the 
cognitive space. 


Identification and Cue Values 


When neither recognition nor assimilation 
provides the required cue values for an O, 
they can still be inferred in much the same 
way that classes are identified. When the 
cue inference is begun O’s class is already 
known through recognition, assimilation, or 
identification. Moreover, some of its lower 
order cue values are known and sometimes, 
if it has been recognized or if cue values 
from other 6s have been assimilated, some 
of its higher order cue values are also 
known. Knowledge of the class together 
with each of the known cue values provides 
the foundation for inferring the necessary 
cue values. 

In terms of the cognitive space, O pos- 
sesses both known cue values and a known 
class; the goal is to infer a cue value on a 
specific cue dimension for which the value 
is unknown. The reasoning underlying this 
inference is parallel to that upon which the 
class identification method is based. Here, 
however, S need not deal with all 4s in his 
cognitive space. He can begin by consider- 
ing only 6s that belong to O’s class as rele- 
vant to the inference. Then each of O’s cue 
values defines a relative frequency distribu- 
tion upon the unknown cue dimension. This 
distribution yields the proportion of 6s (a) 
that belong to O’s class, k; (b) that possess 
the same cue value as O possesses, d; and 
(c) that possess each of the possible values 
of the unknown cue dimensions, c = | 
through 7. This proportion, P(c/k,d), indi- 
cates the likelihood that a value c on the un- 
known cue dimension is the best bet for the 
inference given O’s class. The cue value as- 
sociated with the highest proportion is the 
best bet for the inference about O’s un- 
known cue value. 

When, as is usually the case, S knows 
more than one of O’s cue values there may 
be conflicting inferences yielded by the dif- 
ferent distributions. As in the inference of 
classes it is possible to compromise among 
these conflicting inferences by summing, for 
each unknown cue value, the proportions 
contributed by each known cue value (keep- 


30 if Lee Roy Bracu 


ing in mind that each proportion is only 
1/n™ of the evidence: for that specific value 
of the unknown dimension). 


Falegiye ceo anal 


d=1 n 


[2a] 


Where, | 

© = a cue value under consideration as 
O’s unknown cue; c = 1 through 7 

d = a dimension upon which O’s cue 
value is known; d = 1 through n 

k = O’s class 

n = the total number of dimensions upon 
which O’s cue values are known 

E(c) is the total evidence indicating that 
each cue value, c, from the unknown cue 
dimension is the best bet for O’s cue value. 
The S’s inference should be the cue value 
associated with the highest value of E(c). 

Lower Level Cue Inferences. For infer- 
ences of the lower level cue values, those in- 
ferred from sensation and used to infer 
classes, Equation 2a must be modified. At 
this stage S knows only the very basic sen- 
sation cues—which are multivalued dimen- 
sions at the level of the nerve if not at the 
level. of. the neuron—and perhaps a few 
other lower level cue values which have al- 
ready been inferred. The class is not known 
and therefore it must be dropped from 
Equation 2a, which becomes: 


Blea [2b] 


a=1 n 


Through the inference process described 
by Equation 2b O’s basic cue values are 
built up and stored in the cognitive space 
in terms of the location of ¢. After enough 
values are known, or when no more are 
forthcoming, O’s class is determined through 
recognition, assimilation, or identification. 
Then, higher order cues can be inferred 
through their association with the recog- 
nized 6, through their assimilation from a 
similar 6, or, now that the class is known, 
through identification inferences and Equa- 
tion 2a. 

Relevant Research. When an O is recog- 
nized, its cue values are immediately re- 
vealed for a large number of cue dimensions. 
This is a familiar and obvious happening 


which is exactly what we mean in the every- 
day use of the word “recognize.” For as- 
similation and identification, however, the 
cue inference process is less apparent and 
as yet, little appropriate research has been 
done. 

Some light is cast upon the assimilation 
and identification process by data obtained 
by Beach (1961). This research followed 
the same plan as that described in the pre- 
ceding monograph. The Ss were trained on 
a deck of cards and then were required to 
make inferences about the values of cues 
which were missing from test cards. In most 
cases, the inferences about the unknown cue 
values were dictated by assimilation; the 
cue value possessed by a training O which 
possessed cues similar to the test O’s known 
cues was given as the test O’s missing cue. 
In some cases, however, identification 
played a role and the inference was pre- 
dicted by Equation 2a. Which method best 
predicted Ss’ inference for a given test O 
appeared to be governed by the degree of 
dissimilarity between the test O and the 
most similar O from the training set. While 
sketchy, these results imply that assimila- 
tion and identification play a role in Ss’ in- 
ferences of unknown cue values. 

There are two other studies which are 
relevant to the role of the identification 
process in cue inference. In the first of 
these (Goodnow, 1954) Ss were taught to 
classify Os on the basis of three dichotomous 
cue dimensions and then were shown test 
Os with one of the cues missing and were 
asked to infer the missing value. It was 
found that Os’ known cues first were used 
to determine their classes and then the 
classes were used to determine the infer- 
ences about the missing cues. Unfortu- 
nately, the results do not yield information 
about whether or not Os’ known cues influ- 
enced the inference in the manner described 
by Equation 2a or whether the inferences 
were determined by the class alone. Never- 
theless, these results indicate that inference 
of O’s class is a first step in the inference 
of unknown cue values and that these un- 


°Note that both the class and cue inference 
equations in Beach (1961) have been rejected in 
favor of those given above. 
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known cue inferences are influenced by S’s 
knowledge of O’s class membership. 

In the second study (Bolles & Bailey, 
1956) five Ss were told 54 Os’ class mem- 
berships together with some of their cue 
values and asked to infer Os’ sizes. The 
average correlation between the inferences 
and Os’ actual sizes was .994. Moreover, Ss’ 
errors were larger for the less familiar 
classes of Os and for the classes which had 
large size variance among their members. 
Bolles and Bailey interpret these results as 
showing the importance of prior knowledge 
of O’s class upon unknown cue inferences. 
The additional cues given about Os are con- 
founded, however, with the knowledge of the 
class. Consequently, the relative roles of 
these two factors are not known, and it is 
impossible to tell whether or not Equation 
2a could account for the results. 

The results of these three studies are 
much too vague to be seriously considered 
as evidence for the theory. At present, re- 
search is being undertaken which will at- 
tempt to examine the cue inference proc- 
esses more thoroughly. 


INFERENTIAL ACCURACY 


Revision Process 


It is important for S’s inferences about 
an O to be as accurate as possible. This is 
true because the inferences must stand up 
when they are used in interactions with O 
and because they become associated with 6 
in the cognitive space and form the basis for 
inferences (through assimilation and iden- 
tification) about other Os. To assure the ut- 
most accuracy for the class and cue values 
associated with a 6 each inference is con- 
stantly being re-evaluated in light of in- 
formation gained in the course of interaction 
with O. 

The revision process consists of utilizing 
environmentally derived cues to reinfer 
each attribute of @ by the identification 
method. The new inference is then com- 
pared with the old one, no matter which 
method originally produced the latter. 

Affirmation of the Old Inference. When 
the old and the new inferences agree, indi- 
cating the stability of O’s attribute and of 


the accuracy of the previous inference, the 
inference is retained as an attribute of O 
and the fact that it was affirmed is recorded 
for future reference. The recording of a class 
or cue value’s affirmation is accomplished 
by affirmation frequency distributions on 
the set of classes at each level in the hier- 
archy and upon the sets of cue values from 


‘each dimension in the cognitive space. These 


distributions record the frequency with 
which each class and each cue value has 
been experienced as an attribute of 6. Affir- 
mation frequency is an important concept 
because it serves to tie together many of the 
previously introduced mechanisms. There- 
fore, before proceeding with the discussion 
of its role in revision let us look at its rela- 
tionship to these other parts of the theory. 
Affirmation frequency, through the ap- 
propriate functions, plays a role in the as- 
signment of classes and cue values to 6s in 
the space. As was discussed earlier (see sec- 
tions on Class Hierarchy and Cue Function) 
assignment is dependent partially upon the 
relative frequency with which the classes or 
cues have been associated with 6. These 
relative frequencies are derived from the 
affirmation frequencies for the appropriate 
set of classes or cues. Also, in the discussion 
of the factors which influence the selection 
of a 6 for recognition or assimilation (see 
p. 26) it was stated that one important de- 
terminant was S’s familiarity with 6. Fa- 
miliarity is determined by the sum of the 
affirmation frequencies across the set of all 
classes together with the total number of 
cue values known for @ (from the functions 
which assign the last associated cue value to 
6). In this way affirmation frequency plays 
a role in determining which inference 
method will be used to infer an O’s class. 
Revision of the Old Inference. When a 
6’s attribute is re-evaluated and affirmed, it 
is retained as one of the classes or cue values 
and its frequency of affirmation is increased. 
If, however, the new inference disagrees 
with the old one, a decision must be made 
about which is correct. If it is decided that 
the old inference is incorrect, another tally 
must be made for the affirmation distribu- 
tion on the appropriate set of classes or cue 
values, and the function which records the 
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last associated class or cue value must be 
revised. When the latter change is made for 
cue values, the result is a change in 6’s loca- 
tion in the cognitive space.‘ 

The revision decision is made by compar- 
ing two factors, the frequency with which 
the old inference has been affirmed and the 
magnitude of the H(k) or #(c) for the new 
inference. Obviously, if the old inference 
has been frequently affirmed, it is not pru- 
dent to replace it with the new inference un- 
less the latter is shown to be very probably 
in light of new information about 6. On the 
other hand, an old value which has been af- 
firmed only a few times is not necessarily 
a better choice than the new inference. 


Inferential Confidence 


To compare frequency of affirmation and 
the magnitude of E(k) or E(c) it is neces- 
sary to introduce another concept into which 
they both can be transformed. This concept 
is degree of inferential confidence. It is posi- 
tively related to both frequency of affirma- 
tion and to magnitude of E(k) and E(c). 
The inference with the higher confidence 
value is the one assumed to be correct for 8. 

Measurement of Inferential Confidence. 
Like affirmation the concept of inferential 
confidence ties together many of the mecha- 
nisms which have been introduced through- 
out the previous discussion. It is defined as 
the degree to which S believes that a de- 
cision or an inference is accurate, and it can 
be measured in a number of ways. The most 


7In light of the present analysis it must be as- 
sumed that Ss in the experiment reported in the 
previous monograph made two inferences for the 
cards. The first inference utilized cues on the faces 
of the cards and resulted in correct inferences for 
the confirming cards and in incorrect inferences for 
infirming cards. The second inference used these 
cues and the answer given by the experimenter— 
which, by definition, was a perfectly valid cue with 
a P(k/d) = 1.00 for the correct class. For the in- 
firming cards the second inference disagreed with 
the first and thus, through revision, replaced the 
former as the card’s class in S’s cognitive space. 
However, until familiarity with the cards had pro- 
eressed to the point that he could utilize regoni- 
tion, S continued to give the incorrect inference 
based on the cues on the faces of the infirming 
cards. As he began to utilize recognition, the in- 
ferences were the classes stored in the cognitive 
space and, for infirming cards, ceased to be the high 
E(k) class and began to be the correct class. 


candid method, and one with apparently 
high validity, is merely to ask S how con- 
fident he feels about a decision or inference 
(see, for example, Anderson & Walen, 1960; 
Irwin, 1953; Pollack & Decker, 1958). A 
more complex method is to ask S to place 
bets on his decisions or inferences (Ed- 
wards, 1961). Another method, which to the 
author’s knowledge has never been used, is 
to utilize the degree of specificity or gen- 
erality in S’s interactions with an O to de- 
termine his confidence in the inferences 
upon which the interaction is based. If S is 
confident about the accuracy of his infer- 
ences about an O, those aspects of the in- 
teraction which are based on the inferred 
attributes should reflect the confidence. In 
this situation, high inferential confidence 
should be revealed in the form of fairly spe- 
cific predictions about O and O’s reaction to 
S’s behavior toward it. If, on the other hand, 
S lacks confidence in his inferences about 
the relevant attributes of O, he can make 
more cautious predictions about O. Less 
specific, general predictions will permit in- 
teraction—and thus new knowledge with 
which to re-evaluate the questionable infer- 
ences—without jeopardizing §S’s_ goals 
through a faux pas. This sort of behavior is 
frequent in social situations in which S’s 
interaction with O is dictated solely by so- 
cial form until he has an opportunity to 
“size-up” O. Proper examination of the gen- 
erality and specificity of behavioral inter- 
actions with Os could possibly lead to a 
method of estimating S’s confidence in the 
inferences underlying the interaction. 
Inferential Confidence and Inference 
Method. Inferential confidence is essentially 
a scale of subjective probabilities about the 
correctness of an inference. Each inference 
is associated with a value from this scale, 
and the critical value for acceptance or re- 
jection of the inference is determined by 
S’s motivation to be correct. Thus, for ex- 
ample, in Figure 2 the critical distance is 
the degree of difference between 6 and ¢ be- 
yond which S’s inferential confidence is too 
low to accept that they derive from the 
same O. Similarly, the rejection of assimila- 
tion and the switch to identification is gov- 
erned by S’s confidence about assimilation 
inferences when large distances exist be- 
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tween 6 and ¢ or when the frequency of con- 
tact with 6 is small. 

Inferential Confidence and the Class Hi- 
erarchy. Inferential confidence also plays 
an important role in the selection of the 
level in the class hierarchy where an O’s 
class will be inferred. At the lower levels of 
the hierarchy the classes are more tightly 
defined and contain very few @s. As a result, 
assignment of Os to classes on these levels 
runs a good chance of error. However, if 
the H(k) for the most likely class on one 
level is not large enough to warrant suffi- 
cient inferential confidence about being cor- 
rect, S can move up the hierarchy until a 
satisfactory level is found. While S’s moti- 
vation to make a correct class inference 
will tend to make him select a high level in 
the hierarchy at which to work, his motiva- 
tion to be correct on subsequent cue infer- 
ences will tend to make him select a low 
level. The hierarchy level at which the class 
inference is finally made will be where 8’s 
degree of confidence for the class inference 
balances the degree of confidence for sub- 
sequent cue inferences (cf. section on Selec- 
tion of a Hierarchy Level). 

As these examples indicate, inferential 
confidence provides a common link for 
many of the concepts introduced throughout 
the theory. In addition to all of this, how- 
ever, inferential confidence also governs an- 
other aspect of S’s inference behavior—the 
search for new information about O. 


Cue Search 


When inferential confidence is below the 
degree required by the existing level of 
motivation and the inference is assumed to 
be of questionable validity, some sort of 
action is required to remedy the situation. 
The action usually consists of a search for 
further information upon which the infer- 
ence can be re-evaluated and either af- 
firmed or revised. This search for informa- 
tion, called cue search (Bruner, Goodnow, 
& Austin, 1956), may be either active prob- 
ing of the environment through questions, 
manipulation, reading, or the like, or it may 
be passive reception of information in the 
course of cautious interaction with O. In 
either case, it consists of behavior in which 


S obtains totally new information about an 
O or in which he again receives information 
he already possessed about O. The former is 
used to expand his knowledge about O and 
to re-evaluate, and perhaps revise, previous 
inferences. The latter adds to the various 
affirmation frequencies and raises S’s infer- 
ential confidence about those particular at- 
tributes of the O. 

If Sis not wholly confident that he recog- 
nizes an O, he can search for information 
about O. In this situation cue search will be 
initiated when the distance between 6 and 
¢@ 1s very near the critical distance. If O is 
indeed familiar to S, the additional infer- 
mation will usually decrease the size of the 
distance and thereby permit recognition. 
If it is unfamiliar, the distance will usually 
increase and O will not be recognized, and 
either assimilation or identification will be 
used to make inference about O. 

Inferences based upon identification will 
lead to cue search if no E(k) or E(c) is 
sufficiently high to warrant confidence in 
the accuracy of the inference. Indeed, when- 
ever a number of inference alternatives are 
approximately equally likely, whether they 
are based on recognition, assimilation, or 
identification, cue search will usually be 
undertaken to break the deadlock. 

The long run importance of cue search, 
and the revision which results from it, is to 
provide S with accurate and reliable know]l- 
edge about Os with which he interacts. This 
not only assures more successful interac- 
tions with these Os but it also provides a 
solid foundation of knowledge for inferences 
about Os which are encountered for the first 
time. Although Irwin, Smith, and Mayfield 
(1956) found that Ss’ inferential confidence 
was greater when they possessed more in- 
formation about an event than when they 
possessed less, a quantitative statement of 
the relationship is not available. It is clear, 
however, that because they are measurable, 
these two concepts are important keys to 
the investigation of the remainder of the 
theory. By appropriate manipulation of mo- 
tivation and opportunity for cue search, as 
well as by the proper selection of cue values 
associated with experimentally controlled 
Os, it should be possible to learn about the 
empirical relationships which are assumed 
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to exist between these and the other theo- 
retical concepts. 


OVERVIEW AND DiscussION 


The major points of the theory and the 
hypothesized temporal relationships are 
outlined in Figure 4. 

First, O is encountered in the environment 
and at the same time the class hierarchy 
level is determined on the basis of S’s goals, 
his motivation to make a correct class in- 
ference and subsequent cue inferences, and 
the context of the encounter. Next, through 
Equation 2a, O’s immediately apparent at- 
tributes establish its cue values on the cue 
dimensions. These cue values define the po- 
sition of ¢ within the cognitive space. The 
¢ is compared to 6s, and a decision is made 
about whether or not it is similar enough to 
any @ to permit the assumption that they 
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both derive from the same O. If the simi- 
larity is sufficient, the recognition method 
is adopted. If similarity is high but is not 
sufficient to warrant recognition, the assimi- 
lation method is adopted. If the similarity 
between ¢ and each @ is quite low the iden- 
tification method is used. 

Given the inference method and the hier- 
archy level, the next step is to infer O’s 
class. For recognition and assimilation this 
is accomplished through application of the 
class function; for identification Equation 1 
is applied. If there is no class to assign to 
the recognized or assimilated 6, S can change 
hierarchy levels, assign a class, and then 
make intrahierarchy inferences to the pre- 
viously desired level. If this roundabout 
method does not lead to sufficient inferential 
confidence S can utilize the identification 
method for 6. If the identification method 
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Fig. 4. Diagram of the class and cue inference process. 
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does not produce an E(k) for one class 
which is sufficiently greater than the H(k)s 
for the other classes, S must search for more 
information about O. 

After O’s class is determined S can move 
on to inferences about its unknown cue val- 
ues. If O is recognized, or if its class is as- 
similated, 6 may have the desired cue value 
associated with it. If there is no value asso- 
ciated with the selected 6, or if identification 
has been used to determine O’s class, the 
unknown cue values can be inferred through 
Equation 2a. If the identification method 
fails to provide a definitive answer for the 
cue inference, then S must search for further 
information about O. 

The theory is an attempt to build upon 
the research results reported in the previous 
monograph and to fit these results into a 
larger conceptual framework. However, 
care has been taken to make the general 
approach conform to a common sense view 
of how inference behavior takes place. 

The specific roles of recognition, assimi- 
lation, and identification in the attaining 
of knowledge about Os, and the effects of 
these inference methods upon subsequent 
inferences, fit well with common-sense no- 
tions about cognitive functioning. Indeed, 
recognition is the equating of a present O 
with a previously experienced one, and the 
common behavioral pattern is to utilize 
previously obtained knowledge in inter- 
actions with O. This is precisely what we 
mean when we say we recognize our car 
(“the carburetor needs adjusting”), or our 
boss (“he prefers to be called J. R.”), or 
our house (“I can walk in without ringing 
the bell”). 

The process which we have called assimi- 
lation is also commonly experienced. The 
techniques used to operate one typewriter 
work for a similar one even though the two 
are not identical; behavior toward a person 
perceived as highly similar to a person 
whom you know well tends to be a cautious 
version of the behavior which would be ap- 
propriate to the latter, etc. The identifica- 
tion process takes care of the leftover Os, 
those which are too dissimilar to any 6s to 
be either recognized or assimilated. Whether 
or not the specific mechanisms proposed in 


the theory are entirely correct, these three 
kinds of inferences make sense and appar- 
ently cover the field. 

The cue and class hierarchies are also 
common sense. When it is recognized that 
Os must be “reconstructed”’ from sense 
data, so to speak, the hierarchical structure 
of the cues becomes a necessity. It is clear 


that there are very real differences among 


cues; some are closely akin to sensory 
events while others are more abstract. 
Granted that cues all derive from sensa- 
tion rather than from innate ideas, the ab- 
stract cues must be generated by the sen- 
sory events, and they in turn generate other 
knowledge about Os. It is equally clear that 
what an O is influences our interpretation 
of its properties; therefore, the O’s class 
must be known before many of its attributes 
can be properly evaluated. 

The class hierarchy is common sense if 
we realize that every class of Os can be sub- 
divided by segregating its members accord- 
ing to specified similarities among their 
properties and giving the resulting sub- 
groups new generic names. Assuming that 
all classes result from this process of dif- 
ferentiation and renaming, it is easy to see 
how a class hierarchy comes to exist as S 
has increased contact with Os and is re- 
quired to be increasingly discriminate and 
correct in his interactions with them. For 
example, the broad class “aircraft” becomes 
increasingly differentiated by an aviator as 
he gains more and more experience with 
them. Aircraft becomes either “jets” or 
“props,” and the props are differentiated 
into “reciprocating engines” and “turbo- 
props.” These classes are further subdivided 
into smaller classes which are also sub- 
divided, etc. The result is a hierarchy which 
extends from aircraft (which is itself sub- 
sumed under higher order classes) down to 
the particular planes with which the aviator 
has had personal experience. _ 

Another common aspect of our experi- 
ence with Os is their apparent individuality. 
Even though they are usually similar to 
other Os, we experience them as unique in- 
dividuals which differ, no matter how 
slightly, from their fellows. In psychological 
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theorizing, on the other hand, there is a 
tendency to forget this fact. In the present 
theory, an effort has been made to acknowl- 
edge and to utilize an O’s individuality as 
an important part of the inference process. 

Individuality is reflected in recognition 
and in its temporal priority in the class in- 
ference process. Recognition is the quin- 
tessence of dependence upon an O’s individ- 
uality and constancy over time. Similarly, 
cue inference through the identification 
process emphasizes O’s individuality by 
stressing the role played both by O’s class 
and by its unique cue values. The O is not 
merely relegated to a class and, once there, 


merged into the mass of 6s which are its 
members. Rather, inferences about its cues 
are based on only those fellow class mem- 
bers which share its unique cue values. 
The integrity of the individual O also is 
reflected in the theory’s emphasis on affir- 
mation and revision of knowledge about 
each of them. In the course of interaction 
with an O its eccentricities and peculiarities 
become known and old incorrect knowledge 
is revised. In this way S keeps abreast of 
changes in Os which populate his environ- 
ment and, consequently, is capable of con- 
ducting successful interactions with both 
the old and the new Os which he encounters. 
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